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TYPE IX COLLAGEN AND CHIMERAS 

5 L FIELD OF THE INVENTION 

The present invention relates to novel human collagen proteins, 
polynucleotide sequences which encode these novel coUagens proteins, and to the 
use of these novel proteins in the diagnosis and treatment of disease. More 
specifically, the present invention relates to polynucleotides encoding human 
10 a3(IX) collagen and derivatives hereof, iusion proteins of type IX collagen with 
type n and/or type XI collagen subunits and derivatives, and use of these proteins 
and polynucleotides as diagnostic and therapeutic agents. 

n. BACKGROUND 

15 Collagen fibrils, proteoglycan aggregates and glycoproteins are critical 

components of the cartilage extracellular matrix that, collectively, resist 
compression and the tensile and shear forces that are generated during articulation. 
Heinegird and Oldberg, FASEBJ, J:2042-2051 (1989); Mayne and Brewton, 
Cartilage Degradation: Basic and Clinical Aspects (Woessner. J.F. and Howell, 

20 D.S., eds.) Marcel Dckkcr. Inc., New York, pp. 8M08 (1993). Mutations in 
cartilage matrix genes that affect the biosynthesis, assembly or interactions between 
these various matrix components may contribute to degradation of the cartilage 
matrix and the loss of normal cartilage function. Mutations in human collagens 
have been shown to cause a series of chondrodysplasias ranging in severity from 

25 lethal achondrogenesis type II to Stickler atthroophthaimopathy and early onset 
familial osteoarthritis (reviewed by Spranger et al. , Eur. J, Pediatr 75i:56-65 
(1994); Vikkula et aL, Ann. Medicine 26:107-114 (1994); Prockop and Kivirikko, 
Annu, Rev, Biochem. ($4:403-434 (1995)). 

Analyses of type IX collagen demonstrate this molecule is located on the 

30 surface of ^pe U coliagen-containing fibrils in hyaline cartilage and other tissues, 
including the vitreous humor (reviewed by Brewton and Mayne, Extracellular 
Matrix Assembly and Structure (Yurchenco, P.D., Birk, D.E., Mecham. R.P., eds) 
Academic Press, Inc., San Diego, pp. 129-170 (1994)). Type IX collagen is a 
heterotrimer composed of three polypeptide subunits: al(IX), a2(IX) and a3(IX), 
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that are products of distinct genes and that contain alternating non-triple*helical or 
noncoUagenous domains (NCM) and triple-helicat or collagenous domains 
(COLl-3). The three polypeptide suhunits are assembled into a mamre collagen 
molecule with the strucmre a(IX)a2(IX)a3GX) (van der Rest and Mayne, Struaure 
5 and Function of Collagen Types (Mayne, R. and Burgeson, R., eds.) Academic 
Press, Orlando, FL, pp. 195-221 (1987). In addition to type n and type DC 
collagen, hyaline cartilage from a variety of sources also contains significant 
amounts of at least three other collagen molecules, types VI, X and XI. Thomas et 
ai . Ann, Rheumat, Diseases 55:488-496 (1994); Mayne and Brcwion, Cartilage 

10 Degradation: Basic and Clinical Aspects (Woessner, J.F, and Howell, D,S., eds) 
Marcel Dekker, Inc.. New York, pp. 8M08 (1993). Type XI collagen, like type 
IX collagen, is a heterotrimer composed of three different polypeptide subunits, 
al(XD. a2(XI), and a3(XI). Collagen types XH and XIV were also isolated from 
bovine articular cartilage. Watt et aL, 7. Bioi Chem, 267:20093-20099 (1992). 

15 Native type IX collagen molecules interact with type II collagen molecules 

in a highly specific manner so that the domains NCI, COLl, NC2, C0L2 and 
NC3 lie along the surface of the collagen fibril. The interactions between type IX 
and type II collagen are stabilized by multiple covalent crosslinks derived from 
specific lysine residues. See van der Rest and Mayne, J. Biol Chem, 

20 263:1615-1618 (1988); Shimokomaki et ai, Ann. N,Y. Acad. ScL JS0:l-7 (1990); 
Wu et al, J. Biol Chem. 267:23007-23014 (1992). The periodic localization of 
type IX collagen along type II collagen fibrils can be readily visualized by rotary 
shadowing because the collagenous domain COLS and the large globular domain 
NC4 project from the surface of the fibril. Vaughan et al. , 7, Cell Biol 

25 106:99\'991 (1988); Shimokomaki et al, Ann. N.Y, Acad, ScL 580:1-7 (1990). 
In contrast, the type XI collagen heterotrimer is thought to reside in the central 
portion of the fibril. Mendler et al. J. Cell Biol 108:191-97 (1989). 

Cloning and sequencing of the human type II collagen gene and the three 
human type XI collagen genes has been reported. The complete human type 11 

30 collagen gene sequence was reported by Baldwin et al. Biochem. J. 262:521-28 
(1989) and by Su et al. Nucleic Acids Res, 17:9473 (1989). Of the three type XI 
collagen subunits, the a3(XD cham is believed to be the product of the type II 
collagen gene. Bernard etal, J. Biol Chem. 265:17159-66 (1988) discloses 



wo 97/17988 ^ PCTAJS96/18149 

cDNA sequence purportedly encoding the prool(XI) collagen. Sequence coding 
for the a2(XI) gene was reported by Kimura et al.» J, BioL Chem. 2($4:i3910-16 
(1989). 

The genes encoding the three chains of type IX collagen are excellent 

5 candidates for chondrodysplasias and degenerative disorders that affect the joints 
and/or vitreous humor because type IX collagen is a significant structural molecule 
in both of these tissues. Therefore* cloning of the genes encoding the three type 
IX collagen subunits has been the object of intensive research. Muragaki et aL, 
Eur. J. Biochem, 192:703-^ (1990), presented the complete cDNA sequence of 

10 both alternative transcripts from the human al(IX) gei^. The majority of the 
human a2(IX) collagen cDNA was reported by Perala et aL, FEBSLett. 
319:ni'i0 (1993) and completed by Warman et aL. Genomics 25:158-62 (1994). 
The complete human sequence for the o3(IX) subunit was unavailable until 
recently. As described in the concurrently fded application (U.S. Application to be 

15 assigned). Drs. R.W. Brewton and R. Mayne have identified and characterized the 
full length sequence corresponding to human a3(IX). The information contained in 
the Brewton and Mayne provisional application is mcorporated herein by reference. 

Experiments utilizing transgenic mice suggest that type IX collagen plays an 
important role in maintaining the integrity of hyaline cartilage. Animals that either 

20 express a minigene carrying a deletion in the crl(IX) chain (Nakata et al , Proc, 
Nail Acad, Set U.S.A. 90:2870-2874 (1993)) or that cany disniptcd aldX) genes 
(Fessler et al., Proc. Natl. Acad. ScL U.S.A. 9i:5070-5074 (1994)) develop 
degenerative joint disease that resembles human osteoarthritis. The importance of 
type IX collagen in human disease was verified by the identification of a mutation 

25 in COL9A2 (Muragaki et al. , submitted for publication, (1995)) that results in the 
slapping of exon 3 and that causes Multiple Epiphyseal Dysplasia (EDM2). 

in. SUMMARY OF THE INVENTION 

The present invention relates to novel collagen derivative proteins and the 
30 polynucleotide sequences which encode them. Also described herein are methods 
for diagnosis of diseases resultuig from abnormalities in collagen synthesis or 
structure. 
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One aspect of this invention is the discovery that fusion proteins of human 
type IX collagen may be produced in which a human type DC collagen subunit is 
covaiently linked to human type II collagen and/or a human type XI collagen 
subunit. In one embodiment of this invention, the fusion proteins are 
5 recombinantly produced as chimeras by linking polynucleotide coding sequence for 
a human type IX collagen subunit in frame to polynucleotide coding sequence for 
human type n collagen and/or a human type XI collagen. This chimeric coding 
sequence is inserted into an expression vector and used to transform appropriate 
host ceils. The host cells are then induced to express the chimeric coding sequence 

10 and thereby produce the chimeric collagen fusion proteins. These fusion proteins 
are useful in the treatment of collagen related diseases and conditions. 

The presem invention also relates, in part, to nucleotide sequences and 
expression vectors encoding the chimeric coUagens of the invention. 

Also disclosed herein are methods of treatment of diseases or conditions 

15 associated with abnormalities in collagen production or autoimmunity to collagen. 
Such abnormalities can result in, for example, rheumatoid arthritis, osteoarthritis, 
reactive arthritis, autoimmune hearing disease, cartilage inflanunation due to 
bacterial or viral infections (e.g. Lyme's disease), parasitic disease, bursitis, 
corneal diseases, and ankylosing spondylitis (fusion of the spine). The novel 

20 proteins of the invention are used in these methods of treating collagen related 
diseases - 



IV. BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Salt fractionation of recombinant human type IX collagen 
hcterotrimer of al(IX), 0£2(IX), and a3(IX) in nonreducing 8% sodium dodccyl 
sulfate polyacrylamide gel electrophoresis (SDS-PAGE). 

Figure 2. Salt fractionation of recombinant human type IX collagen 
heterotrimer of al(IX), cr2(IX), and a3(IX) in reducing 10% SDS-PAGE. 

30 V. DETAILED DESCRIPTION 

The presem invention relates to the polynucleotide and nucleic acid 
sequences encoding the recombinant fusion proteins of type DC collagen derivatives 
with type II collagen protein and/or type XI collagen protein and such fusion 
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proteins. Also within the scope of the invention are methods of using these 
collagen fusion proteins to treat collagen related disorders and conditions. 



A. l>efiiiitioiis 

5 The term "collagen subunit" refers to the amino acid sequence of 

one subunit of a collagen protein encoded by a single gene, as well as derivatives, 
including deletion derivatives, conservative substitutions, etc. 

A **fusioD protein" is a protein in which peptide sequences from different 
proteins are covalently linked together. 

10 The terra "chimera" or "chimeric" refers to fusion protein produced by 

operably linking polynucleotide coding sequence for two or more collagen protein 
subunits in frame and recombinantly expressing the linked coding sequence as a 
single peptide chain. 

"Active human type IX collagen" refers to the native trimeric protein 

15 complex, and may be recombinantly produced. 

The phrase "stringent conditions" as used herein refers to those hybridizing 
conditions that (1) employ low ionic strength and high temperature for washing, for 
example. 0.015 M NaCl/0.0015 M sodium citrate/0.1% SDS at SO'^C; (2) employ 
during hybridization a denaturing agent such as formamide, for example, 50% 

20 (vol/vol) formamide with 0.1% bovine serum albumin/0. 1 % Ficoll/0. 1 % 

polyvinylpyrrDlidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM 
NaCl, 75 mM sodium citrate at 42*'C; or (3) employ 50% formamide, 5 x SSC 
(0.75 M NaCl, 0.075 M Sodium pyrophosphate, 5 x Denhardt's solution, sonicated 
sahnon sperm DNA (50 g/ml). 0.1% SDS, and 10% dextran sulfate at 42**C, with 

25 washes at 42**C in 0.2 x SSC and 0.1 % SDS. 

In accordance with the invention, any nucleotide sequence which encodes 
the amino acid sequence of claimed fusion protein can be used to generate 
recombinant molecules which dirca the expression of the fusion protein. 

The term "purified" as used herem in reference to coUagens denotes that the 

30 indicated molecules are present in the substantial absence of other biological 

macroraolecules, e.g., polynucleotides, proteins, and the like. The term "purifled" 
as used herein preferably means at least 95% by weight, more preferably at least 
99.8% by weight, of the indicated biological macromolecules present (but water, 



wo 97/17988 PCT/US96/I8I49 

o 

buffers, and otlier small molecules, especially molecules having a molecular weight 
of less than 1000 daltons can be present). The term "isolated" as used herein 
refers to a protein molecule separated not only from other proteins that are present 
in the natural source of the protein, but also from other proteins, and preferably 
5 refers to a protein found in the presence of (if anything) only a solvent, buffer, 
ion, or other component normally present in a solution of the same. The terms 
"isolated" and "purified" do not encompass proteins present in their natural source. 

B. Expression Of The Collagen Fusion Pjroteins Of The Invention 

10 

1. Coding Sequences 

In accordance with the invention, polynucleotide sequences which 
encode tjrpe DC, type 11, and type XI collagen proteins, or functional equivalents 
thereof, may be used to generate recombinant DNA molecules that direct the 

15 expression of fusion proteins of type IX collagen subimit with type D collagen 
and/or type XI collagen subunits, or a functional equivalent thereof, in appropriate 
host cells. Such collagen polynucleotide sequences, as well as other 
polynucleotides which selectively hybridize to at least a pan of such collagen 
polynucleotides or their complements, may also be used in nucleic acid 

20 hybridization assays, Southern and Northern blot analyses, etc. 

Due to the inherent degeneracy of the genetic code, other DNA sequences 
which encode substantially the same or a functionally equivalent amino acid 
sequence, may be used in the practice of the invention for the cloning and 
expression of these collagen proteins. Such DNA sequences include those which 

25 are capable of hybridizing to the appropriate human collagen sequence under 
stringent conditions. 

Altered DNA sequences which may be used in accordance with the 
invention include deletions, additions or substimtions of different nucleotide 
residues resulting in a sequence that encodes the same or a functionally equivalent 

30 gene product. The gene product itself may contain deletions, additions or 

substitutions of amino acid residues within a collagen sequence, which result in a 
silent change thus producing a functionally equivalent collagen. Such amino acid 
substimtions may be made on tiie basis of similarity in polarity, charge, solubility, 
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bydrophobicity, hydiophilicity, and/or the amphipatic nature of the residues 
iavolved. For example, negatively cbarged amino acids include aspartic acid and 
glutamic acid; positively charged amino acids include lysine and arginine; amino 
acids with uncharged polar head groups having similar hydrophiliciQ^ values 

5 include the following: leucine, isoleucine, valine; glycine, alanine; asparagine, 
glutamine; serine, threonine; phenylalanine, tyrosine. 

The DNA sequences of the invention may be engineered in order to alter 
the collagen coding sequence for a variety of ends including but not limited to 
alterations which modify processing and expression of the gene product. For 

10 example, alternative secretory signals may be substimted for the native human 
secretory signal and/or mutations may be introduced using techniques which are 
well known in the art, e.^., site-directed mutagenesis, to insert new restriction 
sites, to alter glycosylation patterns, phosphorylation, etc. Additionally, when 
expressing in non-human cells, the polynucleotides encoding the coUagens of the 

15 invention may be modified in the silent position of any triplet amino acid codon so 
as to better confonn to the codon preference of the particular host organism. 

In another mbodiment of the invention, a collagen sequence may be ligated 
to a heterologous sequence to encode a Vision protein. For example, a fusion 
protein may be engineered to contain a cleavage site located between an a3(IX) 

20 collagen sequence and the heterologous protein sequence, so that the q3(IX) 
collagen may be cleaved away from the heterologous moiety. 

In a particularly preferred embodiment, chimeric fusion proteins are 
constructed by ligating sequence encoding a type DC collagen subunit. or derivative 
thereof, to sequence encoding type II collagen and/or type XI collagen subunit. 

25 One of skill in the art will recognize that several techniques are available which 
allow one to ligate all or any part of the coding sequence of a type IX collagen 
subunit to aU or any part of the coding sequences for type n and type XI collagens. 
For example, one may ligate coding sequences together at appropriately chosen 
restriction endonuclease sites. However, in order to ensure that the coding 

30 sequences of the chosen coUagens are ligated in the correct tianslationai frame, it 
may be necessary to engineer restriction sites by site specific mutagenesis. An 
even more advantageous method to join two or more polynucleotide sequences 
makes use of the polymerase chain reaction and appropriately designed primers as 
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described in Section 3. 17. 1 of Ausubel et al, . Current Protocols in Molecular 
Biology, Greene Publishing Associates and Wiley Imerscience, N.Y., (1990). 
Using this method, one of skill in the art may join two or more polynucleotide 
sequences in any configuration. 
5 In an alternate embodiment of the invention, the coding sequence of the 

coUagens of the invention could be synthesized in whole or in part, using chemical 
mediods well known in the an. Sec, for example, Caruthers et al, , Nuc. Acids 
Res, Symp, Ser. 7:215-233 (1980); Crea and Horn, Nuc. Acids Res. 9(10):2i3l 
(1980); Mattcucci and Caruthers, Tetrahedron Letters 21:719 (1980); and Chow 

10 and Kempe. Nuc. Acids Res, 9(1 2) :2i07 -2817 (1981). Alternatively, the protein 
itself could be produced using chemical methods to synthesize the desired collagen 
amino acid sequence at least in part. For example, peptides can be synthesized by 
solid phase techniques, cleaved from the resin, and purified by preparative high 
performance liquid chromatography, (e.g., see Creighton, Proteins Structures And 

15 Molecular Principles, W.H. Freeman and Co., N.Y., pp. 50-60 (1983). The 
composition of the synthetic peptides may be confumed by amino acid analysis or 
sequencing {e.g., the Edman degradation procedure; see Creighton, Proteins, 
Structures and Molecular Principles, W.H. Freeman and Co., N.Y., pp. 34-49 
(1983). 

20 In order to express the collagens of the invention, the nucleotide sequence 

encoding the collagen, or a functional equivalent, is inserted into an appropriate 
expression vector, i.e., a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. 



25 2. Expression Systems 

Methods which are well known to those skilled in the art can be 
used to construct expression vectors containing a collagen coding sequence for the 
collagens of the invention and appropriate transcriptional/transiational control 
signals. These methods include in vitro recombinant DNA techniques, synthetic 

30 techniques and in vivo recombination/genetic recombination. See, for example, the 
techniques described in Maniatis et al. , Molecular Cloning A Laboratory Manual, 
Cold Spring Harbor Laboratory. N.Y. (1989) and Ausubel et al., Qtrrent 
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Protocols in Molecular Biology, Greene Publishing Associates and Wiley 
Intcrsciencc, N.Y. (1989). 

A variety of host-expression vector systents may be utilized to express a 
collagen coding sequence. These inchide but are not limited to microorganisms 
5 such as bacteria transformed with recombinant bacteriophage DNA, plasmid DNA 
or cosmid DNA expression vectors containing a collagen coding sequence; yeast 
transformed with recombinant yeast expression vectors containing a collagen 
coding sequence; insect cell systems infected with recombinant virus expression 
vectors (e.g., baculo virus) containing sequence encoding the collagens of the 

1 ) invention; plant cell systems infected with recombinant virus expression vectors 
/ (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic vir\is» TMV) or 
transformed with recombinant plasmid expression vectors (e.g, , Ti plasmid) 
containing a collagen coding sequence; or animal cell systems. Additionally, the 
collagens of the invention may be expressed in transgenic non-hiunan animals 

15 wherein the desired collagen product may be recovered from the milk of the 

transgenic animal. The expression elements of these systems vary in their strength 
and specificities. Depending on the host/vector system utilized, any of a number 
of suiuble transcription and translation elements, iicluding constimtive and 
inducible promoters, may be used in the expression vector. For example, when 

20 cloning in bacterial systems, inducible promoters such as pL of bacteriophage I, 
plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like may be used; when cloning 
in insect cell systems, promoters such as the baculovirus polyhedron promoter may 
be used; when cloning in plant cell systems, promoters derived from the genome of 
plant cells {e.g,, heat shock promoters; the promoter for the small subunit of 

25 RUBISCO; the promoter for the chlorophyll a/b binding protein) or from plant 
viruses (e.g. , the 35S RNA promoter of CaNTV; the coat protein promoter of 
TMV) may be used; when cloning in mammalian cell systems, promoters derived 
from the genome of mammalian cells (e.g., metallothionein promoter) or from 
mammalian viruses (e.g., the adenovmis late promoter; the vaccinia virus 7.5 K 

30 promoter) may be used; when generating cell lines that contain multiple copies of a 
collagen DNA. SV40-, BPV- and EBV-based vectors may be used with an 
appropriate selectid)le marker. 
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In bacterial systems a number of expression vectors may be advantageously 
selected dep^iding upon the use intended for the collagen expressed. For example, 
when large quantities of the coliagens of the invention are to be produced for the 
generation of antibodies, vectors which direct the expression of high levels of 
5 fusion protein products that are readily purified may be desirable. Such vectors 
include but are not limited to the E. coli expression vector pUR278 (Ruther et aL , 
EMBO /. 2:1791 (1983)), in which the collagen coding sequence may be ligated 
into the vector in frame with the lac Z coding region so that a hybrid AS-lac Z 
protein is produced; pIN vectors (Inouye & Inouyc, Nucleic Acids Res. 75:3101- 

10 3109 (1985); Van Heeke & Schuster, /. BioL Chem. 26^:5503-5509 (1989)); and 
the like. pGEX vectors may also be used to express foreign polypeptides as fusion 
proteins with glutathione S-transferase (GST). In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by adsorption to 
glutathione-agarose beads followed by elution in the presence of free glutathione. 

IS The pGEX vectors are designed to include thrombin or factor Xa protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST 
moiety. 

A preferred expression system is a yeast e3q)re5sion system. In yeast, a 
number of vectors contaimng constitutive or inducible promoters may be used. For 

20 a review see. Current Protocols in Molecular Biology, Vol 2, Ed. Ausubel et al, , 
Greene Publish. Assoc. & Wiley Interscience, Ch. 13 (1988); Grant et aL, 
Expression and Secretion Vectors for Yeast, in Methods in Enzymology, Ed. Wu & 
Grossman. Acad. Press, N.Y. 755:516-544 (1987); Glover, DAM Cloning, VoL 11, 
IRL Press, Wash.. D.C., Ch. 3 (1986); and Bitter, Heterologous Gene Expression 

25 in Yeast. Methods in Enzymology, Eds. Berger & Kimmel, Acad. Press, N.Y. 
752:673^84 (1987); and The Molecular Biology of the Yeast Sacduiromyces, Eds. 
Stralhem et aL . Cold Spring Harbor Press. Vols. I and 11 (1982). 

A particularly preferred system useful for cloning and expression of the 
collagen proteins of the invention uses host cells from the yeast Pichia, Species of 

30 non-Saccharomyces yeast such as Pidua pastoris appear to have special advantages 
in producing high yields of recombinant protein in scaled up procedures. 
Additionally, a Pichia expression kit is available from Invitrogen Corporation (San 
Diego, CA). 
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There are a number of methanol responsive genes in methylotrophic yeasts 
such as PicMa pastoris, the expression of each being controlled by methanol 
responsive regulatory regions (also referred to as promoters). Any of such 
methanol responsive promoters are suitable for use in the practice of the present 
5 invention. Examples of specific regulatory regions include the promoter for the 
primary alcohol oxidase gene from Pichia pastoris AOXl, the promoter for the 
secondary alcohol oxidase gene from P, pastoris AX02, the promoter for the 
dihydroxy acetone synthase gene from P. pastoris (DAS), the promoter for the P40 
gene from P. pastoris, the promoter for the catalase gene from P. pastoris, and the 
10 like. 

Typical expression in lochia pastoris is obtained by the promoter from the 
tightly regulated AOXl gene. Sec Ellis et aL, MoL Cell Biol, 5:1111 (1985) and 
U.S. Patent No. 4,855,231. This promoter can be induced to produce high levels 
of recombinant protein after addition of methanol to the culture. By subsequent 
15 manipulations of the same ceils, expression of genes for the coUagens of the 
invention described herein is achieved under conditions where the recombinant 
protein is adequately hydroxylated by prolyl 4*hydroxylase and, therefore, can fold 
into a stable helix that is required for the normal biological function of the proteins 
in forming fibrils. 

20 Another particularly preferred yeast expression system makes use of the 

methylotrophic yeast Hansenula potymorpha. Growth on methanol results in the 
induction of key enzymes of the methanol metabolism, namely MOX (methanol 
oxidase), DAS (dihydroxyacetone synthase) and FMHD (formate dehydrogenase). 
These enzymes can constimte up to 30-40% of the total cell protein. The genes 

25 encoding MOX, DAS, and FMDH production are controlled by very strong 

promoters which are induced by growth on methanol and repressed by growth on 
glucose. Any or all three of these promoters may be used to obtain high level 
expression of heterologous genes in H. potymorpha. The gene encoding a collagen 
of the invention is cloned into an expression vector under the control of an 

30 inducible H. potymorpha promoter. If secretion of the product is desired, a 
polynucleotides encoding a signal sequence for secretion in yeast, such as the 5. 
cerevisiae prepro-mating factor al, is fused in frame with the coding sequence for 
the collagen of the invention. The expression vector preferably contains an 
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auxotrophic marker gene, such as URA3 or LEU2, which may be used to 
complement the deficiency of an auxotrophic host. 

The expression vector is then used to transform H. potymorpha host cells 
using techniques known to those of skill in the art. An interesting and useful 
5 feature of H, potymorpha transformation is the spontaneous integration of up to 
100 copies of the expression vector into the genome. In most cases, the integrated 
DNA forms muitimers exhibiting a tiead-to- tail arrangement. The integrated 
foreign DNA has been shown to be mitotically stable in several recombinant 
strains, even under non-selective conditions. This phenomena of high copy 

10 integration further adds to the high productivity potential of the system. 

In cases where plant expression vectors are used, the expression of 
sequences encoding the coUagens of the invention may be driven by any of a 
number of promoters. For example, viral promoters such as the 355 RNA and 19S 
RNA promoters of CaMV (Brisson et al.. Nature 570:511-514 (1984), or the coat 

15 protein promoter of TMV (Takamatsu et ai., EMBO J, 6:307-311 (1987)) may be 
used; alternatively, plant promoters such as the small subunit of RUBISCO 
(Coruzzi et al. , EMBO J. 3: 1671-1680 (1984); Broglie et al , Science 
224:838-843 (1984); or heat shock promoters, e.g., soybean hspl7.5-E or 
hspl7.3-B (Gurley et al., Mol. Cell. Biol 6:559-565 (1986) may be used. These 

20 constructs can be introduced into plant cells using Ti plasmids. Ri plasmids, plant 
virus vectors, direct DNA transformation, microinjection, electroporation, etc. For 
reviews of such techniques see, for example, Weissbach & Weissbach, Methods for 
Plant Molecular Biology, Academic Press, NY. Section Vin, pp. 421-463 (1988); 
and Grierson & Corey, Plam Molecular Biology, 2d Ed., Blackie, London, Ch. 

25 7-9 (1988). 

An alternative expression system which could be used to express the 
collagens of the invention is an insect system. In one such system, Autographa 
califomica nuclear polyhidrosis virus (AcNPV) is used as a vector to express 
foreign genes. The virus grows in Spodoptera frugiperda cells. Coding sequence 
30 for the collagens of the invention may be cloned into non- essential regions (for 
example the polyhedron gene) of the virus and placed under conorol of an AcNPV 
promoter (for example, the polyhedron promoter). Successful insertion of a 
collagen coding sequence will result in inactivation of the polyhedron gene and 
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production of non-occluded recombinant vims (i.e., vixus lacking the proteinaceous 
coat coded for by the polyhedron gene). These recombinant viruses are then used 
to infect Spodoptera frugiperda cells in which the inserted gene is expressed, 
(e.^,, see Smith et al, J. Virol. ¥6:584 (1983); Smith, U.S. Patent No. 
5 4,215,051). 

In manunalian host cells, a number of viral based expression systems may 
be utilized. In cases where an adenovirus is used as an expression vector, coding 
sequence for the coilagens of the invention may be ligated to an adenovirus 
transcription/translation control complex, e.g., the late promoter and tripartite 

10 leader sequence. This chimeric gene may then be inserted in the adenovirus 

genome by in vitro or in vivo recombination. Insertion in a non- essential region 
of the viral genome (e.g., region El or E3) will result in a recombinant viiiis that 
is viable and capable of expressing collagen in infected hosts, (e.g. . See Logan & 
Shcnk, Proc. Natl. Acad. Set. (USA) 57:3655-3659 (1984)). Alternatively, the 

IS vaccinia 7.5 K promoter may be used. (See, e.g. , Mackett et al. , Proc. Natl. 
Acad. Sci. (USA) 79:7415- 741? (1982); Mackett et al.,J. Virol. -^9:857-864 
(1984); Panicali er fl/., Proc. Natl. Acad. ScL 79:4927-4931 (1982). 

Specific initiation signals may also be required for efficient translation of 
inserted collagen coding sequences. These signals include the ATG initiation 

20 codon and adjacent sequences. In cases v^re die entire collagen gene, including 
its own initiation codon and adjacent sequences, is inserted into the appropriate 
expression vector, no additional translational control signals may be needed. 
However, in cases where only a portion of a collagen coding sequence is inserted, 
exogenous translational control signals, including the ATG initiation codon, must 

25 be provided. Furthermore, the initiation codon must be in phase with the reading 
frame of the collagen coding sequence to ensure translation of the entire insert. 
These exogenous translational control signals and initiation codons can be of a 
variety of origins, both natural and synthetic. The efficieiK:y of expression may be 
enhanced by the inclusion of appropriate transcription enhancer elements, 

30 transcrq>tion terminators, etc. (see Bittner et at.. Methods in Enzymol. 755:516-544 
(1987)). 

One preferred expression system for the recombinant production of the 
coUagens of the invention is in transgenic non-human animals, wherein the desired 
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collagen may be recovered from the milk of the transgenic animal. Such a system 
is constructed by operably iinldi^ the DNA sequence encoding the coUagens of the 
invemion to a promoter and otter required or optional regulatoiy sequences 
capable of effecdi^ expression in mammary glands. Likewise, required or 
5 optional post-translational emymes may be produced simultaneously in the target 
cells, employing suiuble expression systems, as disclosed in, inter alia, U.S. 
Application, Serial No. 08/037,728, operable in the targeted milk protein 
producing mammary gland cells. 

For expression in milk, the promoter of choice would preferably be from 

10 one of the abundant milk- specific proteins, such as alpha Sl-casein, or b- 
lactoglobulin. For example. 5* and 3' regulatory sequences of alpha Sl-casein 
have been successfully used for the expression of the human lactoferrin cDNA, and 
similarly, the b-lactoglobin promoter has effected the expression of human 
antitrypsin gene fragments in sheep milk producing cells. Wright et at. , 

15 Biotechnology 9:830-833 (1991). In transgenic goats, the whey acid promoter has 
been used for the expression of human tissue plasminogen activator, resulting in 
the secretion of human tissue plasminogen activator in the milk of the transgenics. 
Ebert et aL , Biotechnology 9:835-838 (1991). Using such expression systems, 
animals are obtained which secrete the coUagens of the invention into milk. Using 

20 procedures well-known by those of the ordinary skill in the art, the gene encodmg 
the desired collagen chain can simply be liga^ to suitable control sequences which 
function in the mammary cells of the chosen animal species. Expression systems 
for the genes encoding the required post-translationai enzymes are constructed 
analogously. 

25 Preferably, the coUagens of the invention are expressed as secreted proteins. 

When the engineered cells used for expression of the proteins are non- human host 
cells, it is often advantageous to replace the human secretory signal peptide of the 
coUagen protein with an alternative secretory signal peptide which is more 
efficiently recognized by the host cell's secretory targeting machinery. The 

30 appropriate secretory signal sequence is particularly hnportant in obtaining optimal 
fungal expression of mammalian genes. For example, in metbylotrophic yeasts, a 
DNA sequence encoduig the in-reading frame S. cerevisiae a- mating factor pre-pro 
sequence may be inserted at the amino-terminal end of the coding sequence. The 
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aMF pre-pro sequence is a leader sequence contained in the aMF precursor 
molecule, and includes the lys-arg encoding sequence which is necessary for 
proteolytic processing and secretion (see, e.%.. Brake et ai., Proc, Nat'L Acad, 
ScL USA. 57:4642 (1984)). 

5 In addition, a host ceil strain may be chosen which modulates the 

expression of the inserted sequences « or modifies and processes the gene product in 
the specific fashion desired. Such modifications (e.g., glycosylation) and 
processing (e.g., cleavage) of protein products may be important for the function 
of the protein. Different host cells have characteristic and specific mechanisms for 

10 the post- translational processing and modification of proteins. Appropriate ceils 
lines or host systems can be chosen to ensure the correct modification and 
processing of the foreign protein expressed. To this end, eukaryotic host ceils 
which possess the cellular machinery for proper processing of the primary 
transcript, glycosylation, and phosphotylation of the gene product may be used. 

15 Such mamomalian host cells include but are not limited to CHO, VERO, BHK, 
HeLa, COS, MDCK, 293, WI38, etc. Additionally, host cells may be engineered 
to express various enzymes to ensure the proper processing of the collagen 
molecules. For example, the gene for prolyl-4-hydroxylase may be coexpressed 
with the collagen gene in the host celL 

20 For long-term, high-yield production of recombinant proteins, stable 

expression is preferred. For example, cell lines which subly express the collagens 
of the invention may be engineered. Rather than using expression vectors which 
contain viral origins of replication, host cells can be transformed with collagen 
encoding DNA controlled by ^propriate expression control elements (e.g., 

25 promoter, enhancer, sequences, transcription terminators, polyadenytation sites, 
etc.). and a selectable mailcer. Following the introduction of foreign DNA, 
engii»ered cells may be allowed to grow for 1-2 days in an enriched media, and 
then are switched to a selective media. The selecuble marker in the recombinant 
plasmid confers resistance to the selection and allows cells to stably integrate the 

30 plasmid into their chromosomes and grow to form foci which in turn can be cloned 
and expanded into cell lines. This method may advantageously be used to engineer 
cell lines which express a desired collagen. 
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A number of selection systems may be used, including but not limited to the 
heipes simplex virus thymidine kinase (Wigler ex o/,. Cell 7i:223 (1977)), 
hypoxanthine-guanine phosphoribosyltransferase (Szybaldca & Szybalski, Proc. 
Naxl Acad, ScL USA 48:2026 (1962)), and adenine phosphoribosyltransferase 
5 (Lowy et al, , Cell 22:817 (1980)) genes can be employed in tk-, hgprt- or aprt- 
ceils, respectively. Also, amimetabolite resistamre can be used as the basis of 
selection for dfafr, which confers resistance to methotrexate (Wigler et al. , NatL 
Acad, Set USA 77:3567 (1980); O'Hare et al., Proc. NatL Acad. Sci. USA 
78:1527 (1981)); gpt, which confers resistance to mycophenolic acid (Mulligan & 

10 Berg. Proc. Natl, Acad. Sci. USA 75:2072 (1981)); neo, which confers resistance 
to the aminoglycoside G-418 (Colberre- Garapin et al., J. Mol. BioL 150:1 
(1981)); and hygro, which confers resistance to hygromycin (Samcrre er al,. Gene 
50:147 (1984)). Recently, additional selectable genes have been described, namely 
trpB, which allows cells to utilize indole in place of tryptophan; hisD, which 

15 allows cells to utilize histinol in place of histidine (Hartman Sc Mulligan, Proc. 
Natl. Acad. ScL USA £5:8047 (1988)); and ODC (ornithine decarboxylase) which 
confers resistance to the ornithine decarboxylase inhibitor, 
2*(difluoromethyl)-DL-omithine, DFMO (McConlogue L., In: Current 
Commjunicmions in Molecular Biology, Cold Spring Harbor Laboratory, Ed.) 

20 (1987). 

C. Identification of Transfectants or Transformants that Express the 
Collagen Proteins of the Invention and Purification of the 
Expressed Protdns 

25 The host cells which contain the coding sequence and which express 

the biologically active gene product may be identified by at least four general 
approaches; (a) DNA-DNA or DNA-RNA hybridization; (b) the presence or 
absence of '^marker" gene functions; (c) assessing the level of transcription as 
measured by the expression of collagen mRNA transcripts in the host cell; and (d) 

30 d^ection of the gene product as measured by immunoassay or by its biological 
activity. 

In the first approach, the presence of the collagen coding sequence inserted 
in the expression vector can be detected by DNA-DNA or DNA-RNA 
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hybridization using probes comprising nucleotide sequences that are homologous to 
the collagen coding sequence, respectively, or portions or derivatives thereof. 

In the second approach, the recombinant expression vector/host system can 
be identified and selected based upon the presence or absence of certain "marker" 

5 gene functions (e.g., thymidine kinase activity, resistance to antibiotics, resistance 
to methotrexate, transformation phenocype, occlusion body formation in 
bacuiovinis, etc.). For example, if the collagen coding sequence is inserted within 
a marker gene sequence of the vector, recombinant cells containing collagen coding 
sequence can be identified by the absence of the marker gene function, 

10 Alternatively, a marker gene can be placed in tandem with the collagen sequence 
under the control of the same or different promoter used to control the expression 
of the collagen coding sequence. Expression of the marker in response to 
induction or selection indicates expression of the collagen coding sequence. 

In the third q)proach, transcriptional activity of the collagen codii^ region 

15 can be assessed by hybridization assays. For example, RNA can be isolated and 
analyzed by Northern blot using a probe homologous to the collagen coding 
sequence or particular portions thereof Alternatively, total nucleic acids of the 
host cell may be extracted and assayed for hybridization to such probes. 

In the fourth approach, the expression of a collagen protein product can be 

20 assessed immunologically, for example by Western blots, immunoassays such as 
radioimmuno*precipitatton, enzyme-linked immunoassays and the like. 

The expressed collagen of the invention, which is preferably secreted into 
the culture medium, is purified to homogeneity, e.g., by chromatography. In oi^ 
embodiment, the recombinant collagen protein is purified by size exclusion 

25 chromatography. However, other purification techniques known in the art can also 
be used, including ion exchange chromatography, and reverse-phase 
chromatography. 

D. Uses of The CoUagens of the Invention and Engineered CeU 
Lines 

30 

1. Antibody Production and Screening 

Various procedures known in the art may be used for the 
production of antibodies to epitopes of the recombinamly produced coUagens. 
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Sucb antibodies include but are not limited to polyclonal, monoclonal, chimeric, 
six^le chain. Fab fragments and fragments produced by a Fab expression library. 

For the production of antibodies, various host animals may be immunized 
by injection with a collagen protein including but not limited to rabbits, mice, rats, 
5 etc. Various adjuvants may be used to increase the immunological response, 
depending on the host species, including but not limited to Freund's (complete and 
incomplete), mineral gels such as aluminum hydroxide, surface active substances 
such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole 
limpet hemocyanin, dinitrophenol, and potentially useful human adjuvants sucb as 
BCG (bacilli Calmette-Guerin) and Corynebaaerium par\nm. 

Monoclonal antibodies to a collagen may be prepared by using any 
technique which provides for the production of antibody molecules by continuous 
cell lines in culture. These include but are not limited to the hybridoma technique 
originally described by Koehler and Milstein. {Nature, 256:495-497 (1975)), the 
human B-cell hybridoma technique (Kosbor et aL, Immunology Today, 4.12 
(1983)); Cote et al.^ Proc, Natl Acad. ScL, 50:2026-2030 (1983) and the EBV- 
hybridoma technique (Cole et al. , Monoclonal Antibodies and Cancer Therapy, 
Alan R. Uss, Inc.» pp. 77-96 (1985). In addition, techniques developed for the 
production of "chimeric antibodies'* (Morrison et aL, Proc, Natl. Acad, ScL 
Si:6851-6855 (1984); Ncuberger et al.. Nature. 572:604-608 (1984); Takeda et 
aL, Nature 314:452-454 (1985)) by splicing the genes from a mouse antibody 
molecule of appropriate antigen specificity together with genes from a human 
antibody molecule of apprq3riate biological activity can be used. Alternatively, 
techniques described for the production of single chain antibodies (U.S. Patent No. 
4,946,778) can be adapted to produce a collagen- specific single chain antibodies. 

Antibody fragments which contain deletions of specific binding sites may be 
generated by known techniques. For example, such fragments include but are not 
limited to: the F(ab*)2 fragments which can be produced by pepsin digestion of the 
antibody molecule and the Fab fragments which can be generated by reducing the 
disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression libraries 
may be constructed (Huse et aL, Science 2^6:1275-1281 (1989)) to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity for the 
collagen of interest. 
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2. Therapeutic Uses Of The Collagen Proteins Of The 
Invention 

Another aspect of the invention is to provide methods of 
treating immune system-mediated diseases usii^ the collagen proteins of the 
5 invention. The terms "treatment" or "treating" as used herein with reference to a 
disease refer both to prophylaxis and to the amelioration of symptoms already 
present in an individual. It will be appreciated by the person of ordinary skill in 
the art that a treatment need not be completely effective in preventing the onset of 
a disease or in reducing the symptoms associated with the disease. Any reduction 
of the severity of symptoms, delay in the onset of symptoms, or delay in the 
progression of severity of symptoms is desirable to a patient Persons at risk of 
developing a given immune system-mediated disease may be treated 
prophylacticaliy based on any of a variety of facK>rs suggesting the possible onset 
of an immune system- mediated disease, e.^., family history, genetic markers, 
early symptoms, and the like. 

Immune system-mediated diseases that may be treated by the subject 
methods include, but are not limited to, e.g., rheumatoid arthritis, osteoarthritis, 
reactive arthritis, autoimmune hearing disease, cartilage inflammation due to 
bacterial or viral infections (e.g. Lyme's disease), parasitic disease, bursitis, 
corneal diseases, and ankylosing spondylitis (fusion of the spine). The subject 
methods of the invention comprise the step of administering an effective amount of 
a composition of the invention, e.g., collagens, collagen derivatives. Preferred 
compositions for use in treating specific immune system-mediated diseases are 
fusion proteins of type DC collagen, preferably chimeras of a type IX collagen 
subunit with type n collagen and/or type XI collagen, and derivatives and subunits 
thereof, as well as as described in the preceding sections. In a preferred 
embodiment of the subject methods, the composhions administered to the subject 
con^rise variably glycosylated collagens. The compositions administered in the 
subject methods are administered so that the active components, Le, , collagens 
and/or collagen derivatives, contact the lymphoid tissue of the gut, e.g, , Peyer's 
patches or other similar sites, so that immune tolerance is induced. Such 
administration may be ejected, by among many possible methods, through the use 
of formulations comprising the subjected compositions that are designed for oral 
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administradon, i.e., the active components are not destroyed or inactivated in the 
mouth, stomach, or other portions of the digestive system prior to contacting the 
appropriate gut lymphoid tissue. The treatment methods of the invention may also 
comprise the steps of administering additional phannaceutical compounds for the 
5 treatment of immune system-mediated diseases, such as anti-inflammatory agents 
and the like. 

The dosage at which the subject con^K>sitions are administered may vary 
within a wide range and will dq)end on various factors such as for example the 
severity of the inflammation, the age of the patient, etc., and may have to be 

10 individually adjusted. As a possible range for the amount of collagen(s) and/or 
collagen(s) derivatives which may be administered per day may be in the range of 
from about 0.001 mg to about 200 mg. Preferably, the amount of collagen and/or 
collagen derivatives administered is low, thereby favoring the induction of immune 
tolerance by suppression rather than clonal anergy. The pharmaceutical 

IS compositions containing the collagen(s) and/or conagen(s) derivatives may suitably 
be formulated so that they provide doses within these ranges either as single dosage 
units or as multiple dosage units. 

The optima] dosage of tolerance inducing compositions for use in the 
methods of the invention will vary in accordance with a number of factors. The 

20 terms "dosage" and "dose** as used herein, unless indicated otherwise, may refer 
not only to a single administration of a composition but may be used to refer to the 
total amount of a given pharmaceutical composition administered over a selected 
period of time and involving multiple individual administrations. Factors affecting 
the optimal dosage include the choice of collagen molecule or molecules (and/or 

25 collagen derivatives) administered to the patient, the specific mucosa binding 
molecules selected, the age of the patient, the severity of the disease, other 
diseases that may be present in the patient, inert components in the formulation, 
adjuvants, and the like. There may be considerable variation in the range of 
dosages that are effective in treating a given immune disorder. Different dosages 

30 of the same pharmaceutical composition may produce the desired tolerance effect 
by different mechanisms. Although the operation of the invention is not dependent 
upon a particular theory of operation, the person of ordinary skill in the art will 
better imderstand the invention and provide additional embodiments by appreciating 
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that there are believed to be two primary mechanisms by which oral tolerance is 
mediated. Oral tolerance may be mediated by active cellular suppression in which 
regulatory T cells that suppress the activation and proliferation of lymphocytes 
specific for tolerized antigen. Another mechanism of oral tolerance induction is 

5 clonal anergy in which T lymphocytes having a suitable receptor are rendered 
unresponsive. Generally active suppression tolerance is favored by "low" doses of 
a tolerizing antigen and clonal anergy is favored by comparatively ''high" doses of 
the same tolerizing antigen. A review of the principles and techniques for oral 
tolerance induction can be found in Werner et aL, Annual Review of Immunology, 

10 pp. 809-835, Annual Reviews (1994). 

The subject compositions may be fomiulated as phannaceutical 
compositions so as to be adapted for certain types of administration to mucosal 
surfaces, e.g., oral, topical, and inhalation. The preferred form of formulation for 
oral administration in a form where the collagen and/or collagen derivatives in the 

15 composition come into contact with intesdnal lymphoid tissue, e.g., Peyer's 
patches. Compositions of the invention may be administered topically, orally, 
intranasally, by ii\}ection or by inhalation in the form of a phannaceutical 
compositions comprising a collagen(s) and/or coUagen(s) derivatives in the form of 
the original compound or optionally in the form of a pharmaceutically acceptable 

20 salt thereof, in association with a pharmaceutically acceptable carrier which may be 
a solid, semi-solid or liquid diluent or an ingestible capsule, and such preparations 
comprise a further aspect of the invention. The coUagen(s) and/or collagen(s) 
derivatives and mucosa binding collagen conjugates may also be used with earner 
material. As examples of phannaceutical preparations may be mentioned tablets, 

25 drops such as nasal drops, preparations for topical application such as ointments, 
jellies, creams and suspensions, aerosols for inhalation, nasal spray, liposomes, 
etc. Usually the collagen(s) and/or coliagen(s) derivatives will comprise between 
0.05 aiKi 99%, or between 0.1 and 99% by weight of the preparation, for example 
between 0.5 and 20% for preparations intended for injection and between 0.1 and 

30 50% for preparations intended for oral administration. 

To produce phannaceutical preparations in this form of dosage units for 
oral application containing a compound of the invention the active ingredient may 
be mixed with a solid, pulverulent carrier, for example lactose, saccharose. 
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soibitol, nuumitol, a starch such as potato starch, com starch, amyiopectin, 
laminaria powder or citrus pulp powder, a cellulose derivative or gelatine and also 
may include lubricants such as magneshim or calcium stearate or a Carbowax" or 
other polyethylene glycol waxes and are compressed to form tablets or cores for 
5 dragees. If dragees are required, the cores may be coated, for example, with 
concentrated sugar solutions which may contain gum arable, talc aid/or titanium 
dioxide, or alternatively with a film forming agent dissolved in easily volatile 
organic solvents or mixtures of organic solvents. DyestufEs can be added to these 
coatings, for example, to distinguish between diffcrem contents of active substance. 

10 For the preparation of soft gelatine capsules consisting of gelatine and. for 

example, glycerol as a plasticizer, or similar closed capsules, the active substance 
may be admixed with a Carbowax** or a suitable oil as e.g. sesame oil, olive oil, 
or arachis oil. Hard gelatine capsules may contain granulates of the active 
substance with solid, pulverulent carriers such as lactose, saccharose, sorbitol, 

15 mannitol, starches (for example) potato starch, com starch or amylopectin), 
cellulose derivatives or gelatine, and may also include nuignesium stearate or 
stearic acid as lubricants. 

The compositions of the invention may also be formulated so as to provide 
a sustained release. By using several layers of the active drug, separated by slowly 

20 dissolvii^ coatings sustained release tablets may be obtained. Another way of 
preparing sustained release tablets is to divide the dose of the active drug iiuo 
granules with coatings of different thicknesses and compress the graimles into 
tablets together with the carrier substance. The collagen(s) and/or coUagen(s) 
derivatives and mucosa binding collagen conjugates may also be incorporated in 

25 slowly dissolving tablets made, for instance, of fat and wax substances or evenly 
distributed in a tablet of an insoluble substance such as a physiologically inert 
plastic substance. 

In order to obtain dosage units of oral preparations - tablets, capsules, etc. 
- which are designed so as to prevent release of and possible decomposition of the 
30 active substance in the gastric juice, the tablets, dragees etc. may be enteric- 
coated, that is provided with a layer of gastric juice*resistant enteric fifan or coating 
having such properties that it is not dissolved at the acidic pH in the gastric juice. 
Thus, the active substance will not be released until the preparation reaches the 
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intestines. As example of such known enteric coatings may be mentioned cellulose 
acetate pbtbalate, hydroxypropyl-methylceliulose phthalates such as those sold 
under the trade names HP 55 and HP SO, and Edragit^ and Eudragit^. 

Liquid preparations for oral a^lication may be in the form of elixirs* 
5 syrups or suspensions, for example solutions containing from about 0.1% to 20% 
by weight of active substance, sugar and a mixture or ethanol, water glycerol, 
propylene glycol and optionally aroma, saccharine and/or carboxymethylcellulose 
as a dispersing agent. 

10 VI. EXAMPLES 

The invention will be further understood by reference to the following 
examples, which are intended to be purely exemplary of the invention. 

A. EXAMPLE 1: Expression Of Recombinant a3(IX) Collagen 
15 Subunit In Pichia Pastoris 

PGR primers for the amplification of the oc3(IX) collagen cDNA 

coding sequence from the plasmid p545 and the cDNA library clone RB410 are 

prepared. The primers are designed such that they introduce an Eco RI site at the 

5' and the 3* termini of the a3(IX) collagen coding sequence, and a unique 

20 restriction site is used to join the two halves of the coding sequence found in these 

two clones. 

A Primer 1 and a Primer 2 are used to amplify the mamre amino-terminal 
coding sequence for oc3(IX) collagen from plasmid p545 using standard PCR 
conditions as described m Ausubel et al., Current Protocols in Molecular Biology, 
25 Greene Publishing Associates and Wiley Interscience, N.Y. (1990). A Primer 3 
and Primer 4 are used to ampliiy the remaining cDNA coding sequence, inchiding 
the stop codon, from the cDNA clone RB410 as described above. The resulting 
PCR product is digested with the chosen unique restriction endonuclease and with 
EcoRI. 

30 Commercially available expression vector pPIC9 (Invitrogen, San Diego, 

CA), which directs secreted expression in Pichia pastoris is digested with 
restriction endonuclease EcoR I, followed by calf intestinal phosphatase 
(Pharmacia), and then heat denamration at TO^'C for 5 minutes. The digested PCR 
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products and the pPIC9 vector are gel purified as described in Example 3 and a 
tbtee-way ligation is performed. After transformation into competent Escherichica 
coli, correctly ligated plasmids are identified by restriction analysis and confirmed 
by sequencing using the commercially available Pichia sequencing primers 
5 (Invitrogen, San Diego, CA). 

The a3(IX) Pichia expression vector is linearized and used to transfonn 
spheroblasts of a his4 Pichia pastoris strain which also expresses prolyl-4* 
hydroxylase. Transformants are identified on histidine deficient media and are 
confumed by assaying for the loss of the AOXl gene by slow growth on methanol 
10 media. Expression of the a3(IX) gene is induced by growing cells on methanol as 
the sole carbon source. a3(IX) collagen subunit protein Is secreted into the growth 
medium ami subsequently purified using standard centriiugation, filtration, and 
chromatogr^hic techniques. 

15 B« EXAMPLE 2: Expression Of Tkimeric Human Type K Collagen 

In Pkhia Pastoris 

In a similar manner, the Pichia pastoris stram which produces 
a3(IX) collagen subunit is engineered to coexpress the al(IX) and q2(IX) collagen 
subunits in tiie same cell. 

20 

C EXAMPLES: Expression Of Trimeric Human Type K CoUagen 
In Spodoptera Erugiperda Sf9 Insect Cells 

Three recombinant viruses were generated by cotransfection of 

recombinant al(IX), aZQX), and Qr3(IX) constnicts and a modified Ataographa 

25 califormca nuclear polyhedrosis virus DNA into Spodoptera frugiperda Sf9 msect 
cells using the Baculogold transfection kit (Pharmingen). The sequences used to 
construct the three a(IX} chains are disclosed in van der Rest and Mayne, Structure 
and Function of Collagen Types (Mayne, R. and Burgeson, R. eds.) Academic 
Press, Orlando, FL. pp. 185-221 (1987), The resultant viral pools were collected. 

30 anq)lified, and plaque- purified as described by Gnienwald, S. and Heitz, J„ 

Baculovirus Expression System: Procedures d Methods Manual, Pharmingen, San 
Diego. CA (1993). 
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Spodopterafhigiperda Sf9 insect cells were cultured in TNH-FH medium 
supplemented with 10% fetal bovine senim (BioClear) as monolayers at ll^'C, 
^proximately, 5 x 10^ insect cells were infected with recombinant human al(IX), 
cx2(IX), and a3(IX) constructs and with the a0 virus for human prolyl 
5 4-hydroxylase (manuscript in preparation). Viruses for the type DC collagen a- 
chains were used in 2 to 3 fold excess over the prolyl 4*hydroxylase virus. 
Ascorbate of 80 /xg/ml was added to the culture medium daily. The culnire media 
was removed 72 hours after infection, and the cell layer was washed once with a 
solution of 0.15 NaCl and 0.02 M phosphate at pH of 7.4. The cells were 

10 harvested by scraping them in 1.4 ml of ice cold solution of 0.5 acetic acid, 0.75 
M NaCl, lOmM EDTA. and ImM PMSF at pH of 2.5. The cells were then 
homogenized, and centnftiged at ISOOO x g for 20 minutes. Supernatant was 
precipitated with NaCl at the fmal concentration of 1.2 M by mixing the sample 
for 12 hours at 4**C. The precipitate was centriiuged at 15000 x g for 20 minutes 

IS at 4''C. The resulting pellet was dissolved in 500 /il of cold SOmM acetic acid for 
three hours at 4^C. A sample of 15 ^1 was analyzed by either non-reducing or 
reducing SDS-PAGE followed by staining with Coomassie Brilliant Blue. The 
materia] was also digested with pepsin for 4 hours at 22^C, and the thermal 
stability of the pepsin-resistant recombinant type IX collagen was measured by 

20 rapid digestion with a mixture of trypsin and chymotrypsin as described in Buckner 
et al. Ami Biochem 110:360-368 (1981). The resulting material was analyzed on 
reducing SDS-PAGE followed by Western Blotting with an antibody to a 
triple-helical collagen. 

The results indicated expression of human type DC collagen as a 

25 heterotrimer of about 300 kDA (Figure 1) consisting of equal amounts of al(IX), 
cx2(IX), and a3(IX) chains as indicated in Figure 2. The thermal stability of the 
recombinant human type DC collagen was analyzed after a brief protease digestion. 
The thennal stability of the recombinant human type DC collagen was over 40''C. 



30 D. EXAMPLE 4: Clonhig And Expression Of A Chimeric Type 

n/DC/XI Collagen Molecule 

The a3(DC) Pichia expression vector described above is modified so 
as to direct expression of a chuneric type n/type IX/type XI molecule. 
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Specifically, the vector is cleaved either 5* or 3* to the a3(IX) collagen coding 
sequence and coding sequence for type n collagen is inserted in-frame. 
Additionally, the vector may be again cleaved, either S\ 3\ or between the type n 
and type DC coding sequence, and sequence encoding type XI collagen is also 
5 inserted in the correct reading frame so as to express a chimeric type II/DC/XI 
collagen molecule. Transformants of competent £. coli are screened for plasmids 
with the desired orientation by restriction digestion and confirmed by sequencing as 
described above. 

Various modifications of the invention, in addition to those shown and 
10 described herein, will become apparent to those skilled in the art from the 

foregoing description. Such modifications are intended to fall within the scope of 
the appended claims. It is also to be understood that all base pair sizes given for 
nucleotides are approximate and are used for purposes of description. 

All references cited herein are hereby incorporated by reference in their 
15 entirety. 
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1. A fusion protein comprising a human type IX collagen linked to 
heterologous peptide sequence. 

5 

2. The fusion protein of Claim 1 wherein the heterologous peptide 
sequence comprises a type n collagen. 

3. The fusion protein of Claim 1 wherein the heterologous peptide 
10 sequence comprises a type XI collagen. 

4. The fusion protein of Claim 1 wherein the heterologous peptide 
sequence comprises type 11 and Qrpe XI collagen. 

15 S. A method for producing recombinant human fusion protein 

comprising: 

(a) culturing a host cell tiansfonned with the recombinant DNA 
expression vector which expresses said fusion protein; and 

(b) recovering die fusion protein from the cell culture. 

20 

6. A protein comprising human recombinant type IX collagen. 



7. A method for producing recombinant human type IX collagen 
comprising: 

25 (a) culturing a host cell transformed with the recombinant DNA 

expression vector which expresses said type IX collagen; and 

(b) recovering the 9pe DC collagen from the cell culture. 
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